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Methods of OBTAiNiNG isoFORM sPECiFic expression in mammalian cells V 



FIELD OF INVENTION 

The present invention relates to the field of gene expression, in particular to 
5 inhibiting the expression of a specific isofonn of a gene in mammalian cells using 
double-stranded RNA and to expressing a specific isoform in a clean back 
ground. The double-stranded RNA based technology of the invention has wide 
applications, such as for determining function of a particular gene isoform or 
developing therapeutic methods for treating diseases. 

10 

BACKGROUND OF THE INVENTION 

Many eukaryotic genes are expressed as multiple isoforms through the 
differential utilization of transcription and translation initiation sites or through 

IS alternative splicing, thus giving rise to proteins of related sequence but with 
biochemically as well as biologically distinct features (Andreadis et al., 1987, 
Annu. Rev. Cell Biol. 3, 207-242). Although, In many cases, multiple isoforms are 
expressed in the same cell at the same time, the expression level and pattern of 
each isoform may vary with cell type and developmental stage, making the study 

20 of each isoform confusing and difficult. 

The conventional approach for investigating the function of individual isoforms 
requires conditions where only one Isoform is expressed or eliminated to provide 
a clean background (i.e. without the influence of other Isoforms). Thus, typically 

25 the desired isoform has been expressed in cells or whole organisms in which the 
target gene has been deleted, a process that is time consuming. While several 
protocols for tissue-specific expression or elimination of a gene of interest have 
been developed (Johnson et al., 1989, Mol. Cell Biol. 9, 3393-3399; Tan, 1991, 
Dev. Biol. 146, 24-37; Gorman and Bullock, 2000, Curr. Opin. Biotechnol. 11, 

30 455-460), reports of isoform-specific gene inactivation (Migliaccio et al., 1999, 
Nature 402, 309-313) or expression in a clean background are limited. Such 
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protocols are usually lengthy, often requiring several months to establish the 
desired conditions. During this time, cells or organisms may adapt to the new 
conditions (Muller, U., 1999, Mech. Dev. 82, 3-21) and caution is required when 
interpreting the results. 

5 

In the last few years, advances in nucleic acid chemistry and gene transfer have 
inspired new approaches to engineer specific inhibition of gene expression. 
Antisense technology has been the most commonly described approach In 
protocols to achieve gene-specific inhibition, although more recently targeted 

10 gene silencing based on double-stranded RNA (dsRNA) inducing a response 

called post-transcriptional gene silencing (PTGS) or RNA interference (RNAI) has 
been proposed. PTGS is a phenomenon originally reported in plants (van der 
Krol et aL, 1990, Plant Cell 2, 291-299; Napoli, et ah, 1990, Plant Cell 2, 931- 
943), where introduction of a transgene causes silencing of the endogenous 

15 homologous gene as well as itself. The mechanism of PTGS involves enhanced 
mRNA degradation with double-stranded (ds)RNA as the trigger (Cogoni and 
Macino, 2000, Curr. Opin. Genet. Dev. 10, 638-643; Carthew, 2001, Curr. Opin. 
Cell Biol. 13, 244-248). A similar phenomenon (quelling) has been observed in 
Neurospora (Cogoni et al., 1994, Antonie Van Leeuwenhoek 65, 205-209). The 

20 mechanism underlying RNAi has been partially elucidated and a 21- to 23-nt-long 
dsRNA was found to be the intermediate/mediator of mRNA decay (Zamore et 
al., 2000, Cell 101 , 25-33; Bernstein et al., 2001 , Nature 409, 363-366). 

In the animal kingdom, dsRNA-mediated gene silencing was first described In the 
25 nematode C. elegans (Fire et ai., 1998, Nature 391 , 806-81 1) and was termed 
RNA interference (RNAi). Subsequently, RNAi has been observed in a wide 
range of organisms including flies, trypanosomes, hydra, zebrafish and mice 
oocytes (Cogoni and Macino, 2000; Bosher and Labouesse, 2000, Nat. Cell Biol. 
2, E31-36). Although RNAi has been used extensively in non-mammalian 
30 systems, dsRNA inhibition has only recently been applied successfully in 

mammalian systems. For example, an efficient posttranscriptional gene-silencing 
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method has been reported that employs a small Interfering double-stranded RNA 
(sIRNA) In mammalian cells (Elbashir et al., 2001, Nature 41 1 , 494-498). The 
authors describe that transfectfon of 21 -nucleotide dsRNA (sIRNA) can trigger 
PTGS of both the co-transfected and the endogenous gene In cultured 
5 mammalian cells. 

Using a cell-free system of dsRNA-medlated mRNA decay, mRNA was shown to 
be cleaved in Drosophila embryonal cell extracts within the region of identity with 
the dsRNA (Zamore et al., 2000). However, the potency of RNAI In womris and 

10 flies suggeste that dsRNA acts catalytically and/or is amplified (Zamore, 2001 , 
Nature Structural Biology 8, 746-750). Although it has been suggested that 
sIRNAs are not replicated by endogenous RNA-dependent RNA polymerase in 
C. elegans, data now exists supporting a model whereby siRNAs indeed act as 
primers for cellular polymerases and are extended to form ds RNAs in both 

15 C.elegans and Drosophila (Lipardi et al., 2001 , Cell 1 07, 297-307; Sij'en et al., 
2001, Cell 107, 465-476; Nishikura, 2001, Cell 107, 415-418). Propagation of 
RNAi to other regions of the mRNA in this way would result in the spread of gene 
silencing to related sequences, thereby not allowing isoform-specific dsRNA 
inhibition. 

20 

There is a need in the art for a system to evaluate gene function, in particular for 
the evaluation of isoform function in mammalian cells, as well as for therapies 
dependent on dsRNA inhibition of closely related mRNAs in mammalian cells and 
this invention meets that need. 

25 
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SUMMARY OF THE INVENTION 

10 

The present invention provides a method of expressing a desired Isoform of a 
gene product in a cell absent undesired isoforms of the gene product, comprising 
the steps of: 

(a) exposing a mammalian cell to at least one nucleic acid that comprises at least 
15 a partially double-stranded ribonucleic acid, the double-stranded portion having 

at least 95% sequence identity, preferably 100% sequence identity, to a common 
nucleic acid sequence shared by two or more isoforms of the gene product; and 

(b) introducing an expression vector encoding a desired isoform of the gene 
product Into the mammalian cell, the nucleic acid encoding the desired isoform 

20 having a sequence comprising one or more mismatches relative to the double- 
stranded portion of the nucleic acid, operably linked to a promoter capable of 
driving expression of the desired isoform in the cell. Preferably, the common 
nucleic acid sequence is at least 19 consecutive nucleotides in length and is 
common to all endogenous isoforms of the gene product in the cell of interest. 

25 

The nucleic acid is preferably 19 to 25 nucleotides long and the double-stranded 
portion of the nucleic acid is preferably 100% Identical to the common nucleic 
acid sequence. In particular, the at least partially double-stranded ribonucleic 
acid preferably contains a double-stranded portion of at least 19 nucleotides, 
30 more preferably of 19 nucleotides, and at least one single-stranded 3' overhang 



wo 03/062414 



7 



PCT/EP03/00611 



of two-nucleotides, preferably both upper and lower strands having a single- 
stranded 3' overhang of two-nucleotides, 

The desired isoform is encoded by a sequence comprising at least one mismatch 
S relative to the double-stranded portion of the nucleic acid to ensure that the 
nucleic acid of at least partially double stranded RNA character does not affect 
expression of the desired isoform. The desired isoform can therefore be encoded 
by a sequence having one, two or more mismatches relative to the double- 
stranded portion of the nucleic acid as long as the altered coding sequence still 
10 encodes the desired product. Thus, at least one codon (e.g., two or more) 

encoding the desired isoform may differ from the endogenously used codon at 
the same position, it Is well known in the art that many amino acids are 
designated by more than one triplet codon. Preferably, the desired isoform has 
an identical protein sequence to the corresponding endogenous isoform (unless 
15 an endogenous wild-type sequence is used to correct an endogenous mutant 
isoform, in which case the wild-type sequence is used), although amino acid 
substitutions, deletions or additions to the isoform sequence may be tolerated, in 
particular conservative substitutions. 

20 In one embodiment, the desired isoform replaces a mutant isofonm expressed by 
the cell, allowing correct gene function. The mutant isoform can be oncogenic, 
apoptotic, tumor suppressive, inflammation Inducive or suppressive, or 
angiogenic, for example. 

25 In a further embodiment, the method of the invention is used to determine the 
function of the desired isoform (i.e, isoform of unknown function). 



30 



The method is useful in all types of mammalian cells. Including cancer cells, such 
as HeLa (cervical cancer), PCS (prostate cancer), MDA-MB-231 (breast cancer) 
and MCF-7 (breast cancer), or cells in vivo. 
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The desired fsoform can be transcribed under the control of any promoter, such 
as an endogenous promoter, a constitutive promoter, an inducible promoter or a 
tissue-specific promoter. 

S The invention also provides methods of assigning function to a desired isoform of 
unknown (or unrecognized) function. Also provided are kits and materials for 
carrying out the Invention, as well as cells exhibiting isoform-specific expression 
in a clean genetic background. The invention also encompasses therapeutic uses 
and compositions based on the methods of the Invention. 

10 

DETAILED DESCRIPTION OF THE INVENTION 

In its broadest aspect, the present invention provides a method of expressing a 
desired isoform of a gene product in a cell absent undesired isoforms of a gene 

15 product, by exposing a mammalian cell to at least one nucleic acid, the nucleic 
acid being at least a partially double-stranded ribonucleic acid and the double- 
stranded portion having at least 95% sequence identity to a common nucleic acid 
sequence shared by two or more isoforms of the gene product; and introducing 
an expression vector encoding a desired Isoform of the gene product into the 

20 mammalian cell, the desired isoform having a sequence comprising one or more 
mismatches relative to the double-stranded portion of the nucleic acid, operably 
linked to a promoter capable of driving expression of the desired isoform in said 
cell. 

25 The nucleic acid will typically be a ribonucleic acid (RNA) that in double-stranded 
form has at least 95%, preferably 98% most preferably 100% sequence Identity 
to a common nucleic acid sequence shared by two or more isoforms of the gene 
product. Preferred RNA molecules for inhibition comprise sequences Identical to 
a common nucleic acid sequence shared by two or more isoforms of the gene 

30 product over at least 15 -25 consecutive bases, preferably over at least 19 

consecutive bases. Preferably the common nucleic acid sequence Is common to 
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all endogenous isoforms of the gene product expressed by the cell of interest 
The RNA sequence is preferably chosen to have identity with exon sequences. 
Gene expression Is inhibited in a sequence-specific manner in that nucleotide 
sequences corresponding to the duplex region of the RNA are targeted for 
5 inhibition. 

Sequence identity (and therefore specificity) may be optimized and common 
nucleic acid sequence determined by sequence comparison and alignment 
algorithms known in the art (see Gribskov and Devereux, Sequence Analysis 

10 Primer, Stockton Press, 1991 , and references cited therein) and calculating the 
percent difference between the nucleotide sequences by, for example, the Smith- 
Waterman algorithm as implemented in the BESTFIT software program using 
default parameters (e.g., University of Wisconsin Genetic Computing Group). 
Sequence specificity is important to ensure that the nucleic acid does not cross- 

15 react and affect expression of other unrelated gene sequences. 

The nucleic acid will typically be relatively short for use in mammalian cells to 
allow efficient delivery of the nucleic acid to the cell and also to avoid any protein 
kinase R (PKR) response of the cell. Alternatively, other ways of circumventing 

20 non-specific cellular responses to dsRNA or more efficient delivery methods can 
be used. Thus, the nucleic acid is at least partially double-stranded RNA in 
character and will typically be 18 to 25 nucleotides long. The double-stranded 
RNA character is provided by sequences at least 18 nucleotides long, preferably 
19 nucleotides in length, which are preferably 100% identical to the common 

25 nucleic acid sequence shared by two or more isoforms of the desired gene. In 
preferred embodiments, the nucleic acid comprises at least one single-stranded 
3' overhang of two-nucleotides, preferably both upper and lower strands having a 
single-stranded 3' overhang of two-nucleotldes. 



30 



The at least partially double-stranded RNA can be formed by a self- 
complementary RNA strand (such as a transcript having an inverted repeat), or 
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10 

by two or more complementary RNA strands. RNA duplex formation may be 
Initiated either Inside or outside the cell. The RNA is Introduced In an amount that 
allows delivery of at least one copy per cell, preferably at least 5, 10, 100, 500 or 
1000 copies per cell, depending on the application. The amount Introduced is 
5 dependent on the desired effect and can be easily detennrtined empirically. 

The nucleic acid may comprise nucleotides or linkages other than those that 
occur naturally in ribonucleic acid, for example, to stabilize the dsRNA from 
degradation, especially wiien RNA is delivered to a cell and not produced by the 

10 cell. Thus, one can employ oligoribonucleotides or oligonucleotides that comprise 
one or more modified (i.e., synthetic or non-naturally occurring) nucleotides. 
Usually, nucleotide monomers in a nucleic acid are linked by phosphodiester 
bonds or analogues thereof. Analogues of phospliodiester linkages include 
. phosphorothioate, phosphorodithioate, phosphoroselenoate, 

15 phosphorodiselenoate, phosphoroanilothioate, phosphoranilidate, 

phosphoramidate, peptide, and the like linkages. Those of skill in the art will 
recognize that the reagents employed are commercially available or. In the case 
of the oligonucleotides, can be prepared using commercially available 
instmmentatlon. Preferably the duplex RNA will comprise ribonucleotide units or 

20 other nucleotide units that allow appropriate processing by the cell and efficient 
Inhibition of the Isofomns expressed by the cell. The nucleic acid (typically RNA) 
may be synthesized either in vivo or in vitro, particuiariy in vitro when short 
sequences are employed. Endogenous RNA polymerase of the cell may mediate 
transcription in vivo, or cloned RNA polymerase can be used for transcription in 

25 vivo or in vitro, essentially as described below. 

Thus, in one step of the method of the Invention a mammalian cell is exposed to 
the nucleic acid described above to inhibit expression of specific isoforms 
expressed in a cell of interest, preferably to inhibit ail related isoforms. The term 
30 "isofomri" is well known in the art and is meant to encompass gene products that 
are produced as a result of differential gene splicing as well as from the use of 
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alternative transcription and translation start sites. In addition, for the purpose of 
the present Invention, the term isoforms include any closely related sequences 
and therefore may include a mutated gene in a cell, such as deleterious point 
mutations and the like. The mutant isoform can be oncogenic, apoptotic, tumor 

5 suppressive, inflammation inducive or suppressive, or angiogenic, for example, 
and its deleterious function corrected by the method of the invention. Although 
the Examples below illustrate the Invention using ShcA, any gene product 
produced in multiple forms can be targeted using the methods of the present 
invention. For example, such proteins may be therapeutically important proteins, 

10 such as enzymes, e.g., kinases (PKB), ras, integrins, E2F, Rb, PGR, other 
signaling molecules and transcription factors. 

The effect of dsRNA on gene expression will typically result in expression of the 
target isoforms being inhibited by at least 10%, 33%, 50%, 90%, 95% or 99% or 
15 more when compared to a cell not treated by this step. 

The mammalian cell can be any cell of interest. Thus, the cell may be cells from 
the inner cell mass, extraembryonic ectoderm or embryonic stem cells, totipotent 
or pluripotent, dividing or non-dividing, parenchyma or epithelium, immortalized 

20 or transformed, or the like. The cell may be a stem cell or a differentiated cell. 
Cell types that are differentiated include without limitation adipocytes, fibroblasts, 
myocytes, cardiomyocytes, endothelium, dendritic cells, neurons, glia, mast cells, 
blood cells and leukocytes (e.g., erythrocytes, megakaryotes, lymphocytes, such 
as B, T and natural killer cells, macrophages, neutrophils, eosinophils, basophils, 

25 platelets, granulocytes), epithelial cells, keratlnocytes, chondrocytes, osteoblasts, 
osteoclasts, hepatocytes, and cells of the endocrine or exocrine glands, as well 
as sensory cells. Preferably for the study of biological function of a particular 
isoform or for studying protein-protein interactions, cells that are easily cultured 
are preferred. These may Include cancer cells, for example, Including HeLa 

30 (cen/ical cancer), PCS (prostate cancer), MDA-MB-231 (breast cancer) and MCF- 
7 (breast cancer) cells. It will be apparent to one of ordinary skill in the art that the 
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teachings of the specification can easiiy be applied to other situations, such as 
cancer cells for the replacement of a mutated "isoform" with one or more desired 
isoform(s). 

S In a further step, an expression vector encoding a desired isoform of the gene 
product is introduced into the mammalian cell, the desired isoform having a 
sequence comprising one or more mismatches relative to the double-stranded 
portion of the nucleic acid, operably iinlced to a promoter capable of driving 
expression of the desired isoform in said cell. The nucleic acid encoding the 

10 desired isoform preferably comprises a sequence comprising one or more 
mismatches, preferably two or more, relative to the double-stranded portion of 
the nucleic acid used to inhibit expression of the endogenous isoforms to ensure 
that the nucleic acid (RNAI) does not inhibit expression of the desired isoform. 
Preferably, the expression vector encodes a desired isoform using at least one 

15 codon, more preferably at least two codons, that differ(s) from the endogenous 
sequence coding the corresponding isoform. Changes in protein sequence 
between the endogenous isoform and desired isoform are also encompassed by 
the present invention, in particular those substitutions, deletion or additions that 
do not affect gene function, which are preferably conservative substitutions. IVIost 

20 preferably, the desired isoform has an identical protein sequence to the 
corresponding endogenous isoform (although the coding sequence will be 
different due to codon usage). 

For transcription of a transgene in vivo (whether the desired isoform or si/dsRNA, 
25 optionally in the form of a hairpin structure), an expression construct comprising 
at least one regulatory region (e.g., promoter, enhancer, silencer, splice donor 
and acceptor and polyadenylation signal) operably linked to the DNA coding for 
the desired RNA transcript(s) may be used to transcribe the RNA strand (or 
strands). The promoter can be of almost any origin. Preferred are promoters that 
30 are active in the chosen host cells like the SV40, beta-actin, metallothionein, T7, 
polyhedrin and cytomegalovirus promoters. The promoter can be a constitutive 
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promoter, an inducible promoter or a tissue-specific promoter, for example, 
allowing inhibition to be targeted to an organ or cell type; or transcription to be 
induced upon stimulation of an environmental condition (e.g., infection, stress, 
temperature, chemical inducers); and/or engineering transcription at a 
S developmental stage or age. Alternatively, a knock-in construct can be used to 
transcribe the nucleic acid of interest under the control of an endogenous 
promoter, as is known in the art. Modified or unmodified RNA can also be 
chemically or enzymatically synthesized by manual or automated reactions as 
described above. The RNA may be synthesized by a cellular RNA polymerase or 
10 a bacteriophage RNA polymerase (e.g., T3, T7, SP6). 

Expression vectors may also include sequences allowing for their autonomous 
replication within the host organism, sequences that encode genetic traits that 
allow cells containing the vectors to be selected, and sequences that increase 
15 the efficiency with which the RNA is transcribed. Stable long-term vectors may be 
maintained as freely replicating entities by using regulatory elements of viruses. 
Cell lines may also be produced which have integrated the vector into the 
genomic DNA and in this manner the transcript(s) is/are produced on a 
continuous basis. 

20 

Thus, an expression vector can further include, if desired, additional sequences 
operably linked to a promoter, such as a reporter gene (e.g., fluorescent proteins, 
e.g., green fluorescent protein, p-galactosidase, alkaline phosphatase, iuciferase, 
CAT, selective gene markers that facilitate the selection of transformants due to 
25 the phenotypic expression of the marker gene (e.g., those expressing antibiotic 
resistance or, in the case of auxotrophic host mutants, genes which complement 
host lesions), or other useful sequences. 



30 



Nucleic acids (whether expression vectors or double-stranded RNA) can be 
introduced into a cell by various standard methods in genetic engineering, 
including physical methods, for example, simple diffusion, by injection of a 
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solution containing the nucleic acid, bombardement by particles covered by the 
nucleic add, soaking the cell or organism In a solution of the nucleic acid, or 
electroporation of cell membranes In the presence of the nucleic acid. A 
particularly preferred method for delivering nucleic acids Is the use of 
S electroporation. Altematively, a viral construct accomplishes both efficient 
introduction of an expression construct Into a cell and transcription of RNA 
encoded by the expression construct. Other methods known In the art for 
introducing nucleic acids to cells may also be used, as is apparent to the artisan, 
such as lipid-mediated delivery systems, chemical mediated transport, such as 
10 calcium phosphate transfection, DEAE-dextran transfection, and the like. The 
transfected host cells can be cultured by standard methods in cell culture. 

The methods of the present invention are particularly useful for determining the 
function of a particular isoform. In this aspect of the invention, the method further 
15 comprises Identifying a phenotype of the mammalian cell compared to when the 
desired isoform is absent, and assigning the phenotype or function to the desired 
isoform. 

The invention also provides a kit comprising reagents useful in the methods of 
20 the Invention, which may include a nucleic acid being at least a partially double- 
stranded ribonucleic acid and the double-stranded portion having at least 95% 
sequence identity to a common nucleic acid sequence shared by two or more 
isoforms of the gene product; and an expression vector encoding a desired 
Isoform of the gene product, the desired isoform having a sequence comprising 
25 one or more mismatches relative to the double-stranded portion of the nucleic 
acid, operably linked to a promoter capable of driving expression of the desired 
isoform In said cell. The preferred particulars of the components used in the 
method are those described hereinabove. 



30 



A mammalian cell exhibiting isoform-specific expression achieved by any of the 
methods of the Invention is also provided. 
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As the isoform may be used to correct aberrant isoforms, also provided is a 
metliod for treating a disease comprising administering to a subject in need of 
treatment an effective amount of a nucleic acid being at least a partially double- 

5 stranded ribonucleic acid and tiie double-stranded portion having at least 95% 
sequence identity to a common nucleic acid sequence shared by two or more 
isoforms (in particular the aberrant isoform) of the gene product; and an 
expression vector encoding a desired isoform of the gene product, the desired 
isoform having a sequence comprising one or more mismatches relative to the 

10 double-stranded portion of the nucleic acid, operably linked to a promoter 

capable of driving expression of the desired isoform in said cell. Therefore. The 
invention provides the use of the nucleic acid as hereinabove described for the 
treatment of aberrant isoform expression and for the manufacture of a 
medicament. 

15 

Administration of pharmaceutical compositions of the invention may be 
accomplished orally or parenterally. Methods of parenteral delivery include 
topical, intra-arterial (e.g. directly to the tumour), intramuscular, subcutaneous, 
intramedullary, intrathecal, intraventricular, intravenous, intraperitoneal, or 

20 intranasal administration. In addition to the active ingredients, these 

pharmaceutical compositions can contain suitable pharmaceutically acceptable 
carriers comprising excipients and other compounds that facilitate processing of 
the active compounds Into preparations which can be used phanmaceutically. 
Further details on techniques for formulation and administration can be found in 

25 the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing Co, 
Hasten PA). 

Phamriaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable cam'ers well known in the art in dosages suitable for 
30 oral administration. Such carriers enable the pharmaceutical compositions to be 
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formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, 
suspensions, etc, suitable for ingestion by tlie patient 

Phamnaceutical preparations for oral use can be obtained through combination of 
5 active compounds with solid excipient, optionaliy grinding a resulting mixture, and 
processing the mixture of granules, after adding suitable additional compounds, if 
desired, to obtain tablets or dragee cores. Suitable exciplents are carbohydrate 
or protein fillers include, but are not limited to sugars, including lactose, sucrose, 
mannitol, or sorbitol; starch from corn, wheat, rice, potato, or other plants; 
10 cellulose such as methyl cellulose, hydroxypropylmethyl-cellulose, or sodium 
carboxymethylcellulose; and gums including arable and tragacanth; as well as 
proteins such as gelatin and collagen, if desired, disintegrating or solubilizing 
agents may be added, sucii as the cross-linl<ed polyvinyl pyrrolidone, agar, 
alginic acid, or a salt thereof, such as sodium alginate. 

15 

Dragee cores can be provided with suitable coatings such as concentrated sugar 
solutions, which may also contain gum arable, talc, polyvinylpyrrolidone, carbopol 
gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable 
organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the 
20 tablets or dragee coatings for product identification or to characterise the quantity 
of active compound (i.e. dosage). 

Pharmaceutical preparations which can be used orally include push-fit capsules 
made of gelatin, as well as soft, sealed capsules made of gelatin and a coating 
25 such as glycerol or sorbitol. Push-fit capsules can contain active ingredients 
mixed with a filler or binders such as lactose or starches, lubricants such as talc 
or magnesium stearate, and, optionally, stabilizers. In soft capsules, the active 
compounds can be dissolved or suspended in suitable liquids, such as fatty oils, 
liquid paraffin, or liquid polyethylene glycol with or without stabilizers. 

30 
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Pharmaceutical formulations for parenteral administration include aqueous 
solutions of active compounds. For injection, the pharmaceutical compositions of 
the invention may be formulated in aqueous solutions, preferably in 
physiologically compatible buffers such as Hanks's solution, Ringer's solution, or 
S physiologically buffered saline. Aqueous injection suspensions can contain 
substances which increase the viscosity of the suspension, such as sodium 
carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the 
active compounds can be prepared as appropriate oily injection suspensions. 
Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or 
10 synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 
Optionally, the suspension can also contain suitable stabilizers or agents which 
increase the solubility of the compounds to allow for the preparation of highly 
concentrated solutions. 

15 For topical or nasal administration, penetrants appropriate to the particular barrier 
to be permeated are used In the formulation. Such penetrants are generally 
known in the art. 

The invention Is further described, for the purposes of illustration only, in the 
20 following examples. Methods of molecular genetics, protein and peptide 
biochemistry and immunology referred to but not explicitly described in this 
disclosure and examples are reported in the scientific literature and are well 
known to those skilled in the art. 

Examples 

25 

Example 1 1nhibition (knock-down) ofShcA In HeLa cells 

The signaling adaptor/scaffold protein ShcA is a member of the She family, which 
consists of three genes, ShcA, ShcB/Sli/Sck and ShcC/N-Shc/Rai (Luzi et al., 
30 2000, Curr. Opin. Genet. Dev. 10, 668-674). ShcA is ubiquitously expressed, 
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whereas ShcB and ShcC are expressed specifically in the brain (Nakamura et al., 
1998, J. Biol. Chem. 273, 6960-6967). There are three isoforms of ShcA, 
p66ShcA, p52ShcA and p46ShcA, derived from a single gene through differential 
usage of transcription initiation sites (p66 versus p52/p46) and translation start 

5 sites (p52 versus p46), which differ only in the amino terminal regions (Luzi et al., 
2000). We chose ShcA to exemplify the Invention as different cellular functions 
have been suggested for each isoform and therefore is of scientific interest to 
obtain Isoform-specific expression of ShcA. However, it will be apparent to one of 
ordinary skill In the art that the invention is not limited in any way to this particular 

10 gene family. 

The primary transcript of p52/p46 mRNA contains the entire sequence of p66 
mRNA; however, the very 5' region of p66 mRNA is present In the first intron of 
p52/p46 mRNA but is absent in the latter mRNA, having been spliced out. The 
15 p46 and p52 isoforms are derived from the same mRNA using different 

translation initiation sites. The p66-siRNA site is in the 5' region only present in 
p66 ShcA mRNA, while h/m-shc siRNA site is in the second exon which is 
present in both mRNAs. Note that the sequence of p66-shc siRNA is from a 5' 
region of human p66ShcA mRNA and Is absent in p52/46 ShcA mRNA. 

20 

Cells were transfected using the OligofectAI\4INE Reagent without siRNA or with 
h/m-shc SiRNA and luc siRNA. Briefly, HeLa cells were cultured in Dulbecco's 
Modified Eagle's Medium (DMEM, Gibco-BRL) supplemented with 10% (v/v) fetal 
calf serum (AMIMED), 0.2 mg/ml streptomycin and 50 units/ml penidllin at 37°C 
25 In a humidified CO2 (5%) incubator. One day before transfectlon with sIRNA, cells 
were plated In 6-well plates In media without antibiotics at 1 .4x10^ cells per well. 

The following 21-mer ollgoribonucleotides pairs were used as siRNA: h/m-shc 
SiRNA from nt 677-697 (in the PTB domain), 5'-CUA CUU GGU UCG GUA CAU 
30 GGG-3' (SEQ ID NO:1 ) and 5'-CAU GUA CCG AAC CAA GUA GGA-3' (SEQ ID 
N0:2). Sequences were derived from the sequence of human p66ShcA mRNA 
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(accession number. HSU7377) and its complement and each pair lias a 3' 
overhang of 2 nt on each side. Designed RNA oligonucleotides were blasted 
against Database (GEMBL) to ensure gene specificity. The RNA oligonucleotides 
were obtained from IVIicrosynth (Balgach, Switzerland). Annealing was performed 

5 according to Elbashir et al. (2001 , Nature 41 1 , 494-498). The complementary two 
strands (each at 20 ^iM) in 200 )xl of annealing buffer (100 mM potassium 
acetate, 30 mM HEPES-KOH pH 7.4, 2 mM magnesium acetate) were heated for 
1 min at 90°C and then Incubated for 1 h at 37''C. An siRNA conresponding to 
nucleotides 753-773 of the firefly luciferase mRNA (luc siRNA) was used as a 

10 negative control. 

siRNAs were introduced into HeLa cells using the OligofectAMINE Reagent (Life 
Technologies) according to the manufacturer's Instructions, with 10 of 20 |xM 
sIRNA and 3 ixl of transfectlon reagent per well. At different times after 

15 transfectlon, whole-cell extracts were prepared and analyzed by Western blotting. 
Cells were lysed in a buffer containing 120 mM NaCI, 50 mM Tris pH 8.0, and 1% 
NP40 plus Complete (Roche) protein inhibitor tablets. The whole-cell extracts (20 
yig) were analyzed by Western blotting for She A, ^tubulin and Grb2 using a 
polyclonal rabbit anti-She antibody (1:250; Transduction Laboratories), mouse 

20 monoclonal anti-Grb2 (1:1 000; Transduction Laboratories) or a mouse 

monoclonal anti-p tubulin antibody (1:1000; Sigma). We used anti-rabbit or anti- 
mouse HRP-linked antibodies from Amersham as secondary antibodies. An 
enhanced chemilumlnescence detection method (ECL; Amersham) was 
employed and the membrane was exposed to Kodak Xomat LS film. 

25 Quantification of ShcA proteins was done using ImageQuant 5.0. 

When HeLa cells were transfected with the h/m-shc siRNA, the levels of all three 
ShcA isoforms were strongly reduced 24 h after transfectlon and reached less 
than 20% of the control after 48 h and 4% after 60 h. The level of control protein 
30 p-tubulin was not affected under the conditions employed, and this was also the 
case for Grb2, a protein that specifically interacts with ShcA upon activation of 
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growth-factor signaling (Luzi et al., 2000). Luc siRNA had no effect. Time-course 
analysis showed that the levels of all ShcA isoforms remained low until the fifth 
day after transfection but started to increase thereafter. The reduction in the three 
isofonns was uniform, suggesting that both p52/p46 and p66 ShcA mRNAs were 
5 equally targeted by the siRNA. According to the Invention, repeated transfection 
maintains or re-establishes the low levels of ShcA isoforms. 

Example 2 Isoform-specifio knock-down 

10 The selective inhibition of the ShcA p66 isoform is demonstrated in this example 
using siRNA specific for p66. Cells were left untreated (control) or transfected 
without siRNA (control) or with h/m-shc siRNA, p66-shc sIRNA (from nt 236-256 
in the CH2 domain, 5'-GAA UGA GUC UCU GUC AUG GUC-3', SEQ ID NO:3; 
and 5'-CGA UGA CAG AGA CUC AUU CGG-3', SEQ ID NO:4) and control luc 

15 siRNA essentially as described in Example 1 . Whole-cell extracts were prepared 
48 h later and analyzed for ShcA and p-tubulin levels as in Example 1 . When 
cells were transfected with p66-shc sIRNA, only the p66ShcA Isoform was 
reduced with Icinetics similar to that obtained with h/m-shc siRNA; the other two 
isofonns were not affected. Similarly, p-tubulin levels were not affected. The level 

20 of p66ShcA in cells challenged a second time with the p66-shc siRNA 6 days 
after the initial transfection, when the level of p66ShcA was very low but about to 
increase, and 10 days after the initial transfection, when the level of p66ShcA 
had recovered substantially, remained low and decreased markedly again after 
transfection at days 6 and 10, respectively. 

25 

This example Illustrates the specificity of the siRNA effect with the effect of p66- 
sch SI RNA being restricted to the p66ShcA isoform. Expression of p52/p46 ShcA 
was not affected although p66 and p52/p46 mRNAs share sequence identity in 
most of the region 3' of the siRNA site, indicating that the silencing signal does 
30 not propagate to regions of mRNA 3' to the siRNA in mammalian cells. 
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In addition, the sequence of p66-shc siRNA is present in tiie primary transcripts 
of p52/p46 SlicA mRNA but is spliced out of tlie mature mRNA. Although not 
wishing to be bound by theory, the fact that only the p66 ShcA isoform was 
downregulated by p66-shc siRNA strongly suggests that the site of action of 
S siRNA is confined to the cytoplasm and to the spliced mRNA. If siRNA acted in 
the nucleus and triggered the decay of transcripts containing the corresponding 
sequence, the p52ShcA and p46ShcA proteins should have been equally 
downregulated by the same p66-shc siRNA, which was not the case. 

10 Example 3 Isoform-specific expression of ShcA 

This Example illustrates a rapid, alternative approach for isoform-specific gene 
expression. The previous Examples show how the adaptor protein ShcA can be 
suppressed in an isoform-specific manner in a human cell line. The siRNA with a 

15 sequence shared by the two ShcA transcripts suppresses p66, p52 and p46 (see 
Example 1). However, an siRNA whose sequence is present only in p66 mRNA 
suppresses only the p66 isoform (see Example 2), suggesting that the siRNA 
signal does not propagate to the 3' region of the target mRNA. In this Example, 
the expression of individual isoforms is achieved by first downregulating all 

20 isoforms by the common (h/m she) siRNA (as in Example 1) and then 

transfecting with an expression vector for the desired isoform harboring silent 
mutations at the site corresponding to the h/m-shc siRNA. This allows functional 
analylsis of individual ShcA isoforms or indeed any other gene encoding multiple 
proteins. 

25 

The full-length mouse p46, p52 and p66ShcA cDNAs were isolated from NIH3T3 
cells by reverse transcriptase-polymerase chain reaction using the sense primers 
5'-CGG AAT TCA TGG GAG CTG GGG TFT GOT ACT-3' (SEQ ID NO:5). 5'- 
CGG AAT TCA TGA ACA AGC TGA GTG GAG GCG-3' (SEQ ID NO:6) and 5'- 
30 CGG AAT TCA TGG ATC TTC TAG CCC CCA AGC CGA AGT A-3' (SEQ ID 
NO:7), respectively, and the common antisense primer 5*-CGG AAT TCA CAC 
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TTT CCG ATC CAC GGG TTG C-3' (SEQ ID NO:8). Full-length ShcA cDNAs 
were initially cloned into pBluescriptll KS-f and nucleotide sequences verified by 
the dideoxynucleotide chain termination procedure. 

5 An HA-tagged expression vector pcDNA3HA was constructed by Inserting the 
overlapping oligonucleotide pair 5"-CCC ACC ATG GCT TAG CCA TAC GAT 
GTT CCA GAT TAC GCT G-3' (SEQ ID NO:9) and 5'-AAT TCA GCG AAT TCT 
GGA ACA TCG TAT GGG TAA GCC ATG GTG GGG TAC-3' (SEQ ID NO:10) 
into the KpnI-EcoRI site of pcDNA3 (invitrogen). To construct expression vectors 

10 for HA-tagged ShcA, p46HA, p52HA and p66HA, the full-length cDNAs of p46, 
p52 and p66 were inserted into the EcoRI-EcoRV site of pcDNA3HA. ShcA 
mutants in which potential internal initiation methionine codons were converted to 
leucine codons, thus expressing only the p66ShcA or 52ShcA forms, were 
created using the Quicl<Change site-directed mutagenesis l<it (Stratagene). The 

15 overlapping oligonucleotide pair 5'-CTC CTC GAG GAC CTG AAC AAG CTG 
AGT G-3' (SEQ ID NO:1 1) and 5'-CAC TCA GCT TGT TCA GGT CCT GGA 
GGA G-3' (SEQ ID NO:12) was used to mutate methionine 65 (start site for p52) 
to leucine in p66HA, resulting in p66HA-m1. Another overlapping oligonucleotide 
pair, 5'-CCA ACG ACA AAG TCC TGG GAC CCG GGG-3' (SEQ ID NO:13)and 

20 5'-CCC CGG GTC CCA GGA CTT TGT CGT TGG-3' (SEQ ID NO:1 4), was used 
to mutate the initiation sites for p46 in both p66HA-m1 and p52HA, resulting in 
p66HA-ML and p52i-IA-ML. Silent mutations were introduced into these vectors 
at the sites corresponding to h/m-shc siRI^ as above using the overlapping 
oligonucleotide pair 5'-GGG GTT TCC TAC TTG GTC CGC TAC ATG GGT TGT 

25 C-3' (SEQ ID NO:1 5) and 5'-CAC AAC CCA TGT AGC GGA CCA AGT AGG 
AAA CCC C-3* (SEQ ID NO:16) to give p46HA-sm, p52HA-ML-sm and p66HA- 
ML-sm. Note that proteins expressed from these vectors are identical to the 
parent proteins. 

30 For isoform-specific expression, HeLa cells were first transfected with no siRNA 
(mock) or with h/m-shc sIRNA to downregulate all three endogenous ShcA 



wo 03/062414 



PCT/EP03/00611 



23 

proteins (isoforms) essentially as described above. Two days later, the cells were 
transfected with an ennpty expression vector, pCDNAS, or an expression vector 
for the desired isoform of wild-type mouse ShcA or silent mutant ShcA using 
Upofectamine 2000 (Life Technologies) according to the manufacturer's 
5 instructions. One day later, whole-cell extracts were prepared and analyzed by 
Western blotting for the ShcA expression level. Membranes were blotted with 
polyclonal anti-ShcA and anti-p tubulin antibodies. 

As already demonstrated in Example 1 , h/m-shc siRNA knocked down all three 
isoforms of endogenous ShcA almost completely. Transfection of these cells with 
mutant expression vectors for individual isoforms resulted in the expression of 
only the corresponding isoforms. Almost no protein was detected in cells 
transfected with wild-type expression vectors demonstrating that the siRNA acts 
on the exogenous ShcA expressed by the vectors. Cells which were not 
transfected with h/m-shc siRNA expressed elevated levels of ShcA isoforms, 
irrespective of the presence or absence of mutations in the expression vectors. 
In contrast, siRNA did not act on the mutated sequences allowing Isoform- 
specific expression of any She A isoform. Cells trasfected with p66HA-ML-sm 
clearly showed specific expression of only p66, whereas p46HA-sm and p52 HA- 
ML-sm resulted in expression of only p46 and p52, respectively. 

The present inventors have shown that siRNA can efficiently, specifically and 
rapidly downregulate the level of an endogenous protein in mammalian cells. The 
effect of siRNA was restricted to mRNAs containing a sequence essentially 
25 identical to the siRNA used. That the primary action of siRNA on mRNA, which is 
most likely an endonucleolytic attack, does not propagate to other regions of the 
target mRNA was inferred from the following observations: (1) the effect of p66- 
shc sIRNA was restricted to p66ShcA (Example 2)and (2) h/m-ShcA siRNA 
targeted wild-type ShcA mRNAs but not mutant mRNAs (Example 3). The 
30 second observation was with ectopically expressed ShcA mRNAs. 
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In this experiment, sequences of wild-type ShcA mRNA and mutant ShcA mRNA 
for each isoform were identical except for two nucleotides at the sIRNA 
recognition site located in the middle of the mRNA. These two nucleotide 
cahgnes were sufficient to avoid the sIRNA effect on the expression of the 
5 desired Isoform. If the silencing signal would spread either 5' or 3' of the siRNA, 
ShcA expression from both wild-type and mutant mRNAs would have been 
suppressed, as at least some of these sequences would be identical. That 
expression from wild-type mRNAs but not from mutant mRNAs was suppressed 
strongly argues for stringent specificity of siRNA-mediated mRNA decay and no 

10 propagation of RNAI In mammalian cells. This is in stark contrast to results of 

experiments in an insect cell-free system (Lipardi et al., 2001 , Cell 107, 297-307) 
and a elegans (Sijen et al., 2001 , Cell 1 07, 465-476) that have suggested that 
long dsRNA molecules synthesized by RNA-dependent RNA polymerase are 
intermediates in RNAi that amplify and maintain the effect of siRNA, implying 

15 5'spreading of the silencing signal from siRNA. If amplification is involved in 

silencing in mammalian cells, antlsense RNA oligonucleotides alone should serve 
as a primer for RNA-dependent RNA polymerase and, thus, be sufficient for gene 
downregulation. We found that antisense RNA did not induce silencing. 

20 An important implication of our results is that the combination of PTGS using 
sIRNA and the isoform-specific expression of an homologous gene with silent 
mutations causes the cell to express only one isoform, while l<eeping the levels of 
other isoforms very low. Using this knockdown-in method, it should be possible to 
examine the effect of various mutations of individual isoforms on cellular activity. 

25 The isoform sequences in the expression vectors should be designed so that 
they are not recognized by the siRNA affecting expression of the cellular genes. 
Two mismatches in the mutated expression vectors were sufficient for them to be 
free of sIRNA-mediated suppression. We used expression vectors for mouse 
ShcA in this work because there is a high degree of sequence similarity between 

30 mouse and human ShcA (90% mRNA sequences and 93% amino acid 
sequences for p66 isoform). Thus, instead of introducing mutations into 
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expression vectors, it would have been possible to design a different sIRNA with 
a sequence matching perfectly the endogenous human ShcA gene but not the 
transfected mouse gene, and achieve similar results. The advantage of our 
approach, however, Is that we can obtain isoform-speclfic ShcA expression in 
5 both human and mouse cells using the same set of probes. 

In summary, we have shown in mammalian cells that the site of action of sIRNA- 
mediated mRNA degradation Is confined to the cytoplasm and that the target 
mRNA Is restricted to those mRNAs containing a sequence essentially Identical 
10 to the sIRNA used. These specific features of sIRNA-medlated gene knock-down 
can be employed over a short time period in conjunction with specific expression 
vectors to establish conditions for expression of the ShcA gene in an isoform- 
specific manner. This knockdown-in method should be applicable and useful for 
the study of any gene expressed as multiple isoforms. 

15 

Hie disclosures of each publication referred to herein, as well as priority 
application GB 0201477.7 filed January 23, 2002, are hereby Incorporated by 
reference as if each were referred to individually. 



